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I. INTRODUCTION 


The  U.S.  Navy's  impact  acceleration  research  program  being  conducted 
by  the  Naval  Aerospace  Medical  Research  Laboratory  (NAMRL)  Detachment  is 
engaged  in  experimentation  concerning  dynamic  response  of  the  human  and 
simian  head/neck  system  as  a function  of  anthropometric  and  acceleration 
parameters.  NAMRL  is  accumulating  data  on  dynamic  response  of  the 
head/neck  body  segment  for  both  humans  and  simians.  Research  on  the 
development  of  a model  of  injury  probability  based  primarily  on  information 
in  the  data  is  contained  in  three  Desmatics  technical  reports  [3,  4,  5]. 

The  development  of  a model  of  injury  that  utilizes  an  experimenter's  prior 
knowledge  about  Injury  probability  in  addition  to  the  sample  information 
may  allow  for  more  accurate  inferences  to  be  made  concerning  the  likelihood 
of  Injury  as  a function  of  dynamic  and  physical  variables. 

This  technical  report  describes  procedures  for  incorporating  prior 
knowledge  with  the  statistical  Information.  Because  this  report  is  an 
extension  of  the  inference  procedures  found  in  a previous  technical  report 
[2],  it  is  suggested  that  report  be  read  before  proceeding  further  with 
this  one.  The  logistic  models  and  simulated  data  used  in  the  present 
report  were  discussed  previously  [4].  Since  the  data  was  simulated,  the 
true  underlying  model  parameters  are  known.  This  allows  an  assessment  of 
the  accuracy  of  the  Inference  procedures  derived.  As  with  inferences 
previously  discussed  [2],  the  inference  of  foremost  importance  in  this 
study  is  the  prediction  of  threshold  levels  which  allow  for  only  a small, 
prespecified  chance  of  head/neck  injury. 


II.  BACKGROUND 


Many  times  a researcher  may  want  to  quantitatively  incorporate  his 
or  her  own  prior  knowledge  about  an  experiment  with  data  to  be  generated 
by  the  experiment.  This  can  effectively  be  accomplished  by  the  Bayesian 
approach  to  statistical  inference.  Through  this  approach,  the  experimenter 
also  determines  the  importance  his  prior  information  should  have  with 
respect  to  the  future  sample  information.  The  Bayesian  approach  uses 
subjective  information  before  the  experimental  data  is  gathered  to  help 
prevent  any  subjective  bias  that  could  result  due  to  observation  of  the 
data.  For  further  discussion  on  this  concept,  see  [6], 

Bayesian  inference  quantitatively  incorporates  prior  information  with 
sample  information  by  the  use  of  a prior  probability  distribution.  This 
distribution  can  be  used  to  quantify  the  experimenter's  prior  beliefs 
and  expert  knowledge  about  the  experiment.  Once  a prior  distribution 
is  chosen,  the  inference  procedure  follows  from  Bayes'  Theorem  in  a 
straightforward  manner.  The  problem  of  assessing  the  prior  distribution 
for  probability  of  injury  is  treated  in  the  following  paragraphs,  which 
discuss  how  to  effectively  combine  an  experimenter's  prior  knowledge  with 
the  sample  information  in  the  form  of  maximum  likelihood  estimates  obtained 
previously  [2]. 

The  assessment  of  a prior  distribution  for  the  parameter(s)  of 
Interest  must  be  done  in  such  a way  as  to  quantify  the  experimenter's 
knowledge  and  intuition  as  accurately  as  possible.  The  prior  distribution 
should  not  be  too  complex  and  should  be  experimentally  interpretable.  For 

-2- 


the  mult ipurameter  logistic  model,  it  is  unlikely  that  the  experimenter 
will  be  able  to  accurately  choose  a multivariate  prior  distribution 
for  0,  the  vector  of  coefficients.  However  for  a given  critical 
vector  of  experimental  conditions,  the  experimenter  should  have  an 
intuitive  idea  of  the  value  of  the  probability*  of  injury,  p(x^).  The 
Bayesian  inference  procedures  in  this  technical  report  employ  a prior 
distribution  for  p(x^)  with  xj  given.  If  p(£q)  Is  to  be  considered  the 
outcome  of  a random  variable,  p(x^),  then  it  is  reasonable  to  assume  that 
the  outcomes  of  p(x^)  should  be  distributed  smoothly  around  some  prior 
estimate  of  p(xq).  This  suggests  the  use  of  the  beta  family  of  prior  dis- 
tributions for  p(Xq)»  The  beta  probability  density  function  is 


g(p)  " 


liaLL 


f(r ' ) r(n*  - r’) 


(1  - p) 


- r 


1, 


where  T is  the  gamma  function  and  r*  and  n'  are  experimentally  interpretable 
parameters.  For  future  clarity  all  parameters  of  prior  distributions  will 
be  primed. 


1 

For  greater  claritv  In  algebraic  expressions,  this  probability  is 
denoted  by  a lower  case  p rather  than  an  upper  case  P,  as  in  previous 
reports  [2,  3,  4,  5], 


III.  ASSESSING  A PRIOR  DISTRIBUTION 


There  are  several  ways  to  choose  the  proper  parameters  for  the 
beta  prior.  For  example,  see  reference  [6].  This  paragraph  describes 
a method  that  is  both  intuitive  and  likely  to  lead  the  experimenter 
into  giving  the  proper  weight  to  hla  or  her  prior  information.  If  p(xq) 
has  a beta  distribution  with  parameters  r'  and  n' , then 

Elp(x<p)  * r'/o'. 

In  a binomial  experiment,  if  n Independent,  identical  trials  are  con- 
ducted and  r "successes"  are  observed,  then  the  estimate  of  p is  r/n. 

The  expression, 

ElpUg)]  - r'/n', 

has  the  interpretation  that  the  experimenter  feels  his  or  her  prior 
information  is  worth  n'  experimental  trials  and  that  in  n'  trials  at 
Xg,  r'  injuries  would  be  expected.  In  other  words,  the  experimenter 
feels  that  his  or  her  information  is  worth  n'  sample  data  observations 
and  expects  the  true  probability  of  injury  to  be  r'/n'  at  x^.  For 
example,  if  the  experimenter  feels,  in  a particular  case,  that  prior 
Information  is  worth  10  sample  observations  and  the  prior  estimate  of 
P(Xq)  is  .1,  then  a beta  prior  with  parameters  r'  - 1 and  n'  - 10  should 
be  used. 

Suppose  the  experiment  is  conducted  at  Xq,  N independent  times. 

Let  y be  the  number  of  injuries  in  the  N trials.  Then  y has  the  binomial 


probability  distribution.  Using  a beta  prior  for  p(Xg)  and  the  binomial 
distribution  for  y it  is  easy  to  compute  E[p(xg)|y],  called  the  posterior 
expectation  of  p(£q),  which  is  an  estimate  of  p(*q)  that  employs  both 
sample  information  and  prior  information.  Elpfegily]  is  computed  from 
the  posterior  probability  density  function,  which  in  turn  is  computed 
from  the  binomial  distribution  for  y and  the  beta  prior  distribution.  The 
fact  that  y has  a binomial  distribution  and  the  prior  density  is  beta 
implies  that  the  posterior  probability  density  for  p(£q)  is  beta  with 
parameters  r"  ■ y + r'  and  n”  ■ N + n'.  (See  (6].)  Throughout  this  report 
posterior  distribution  parameters  are  double  primed.  Thus, 

ElpCx^ly]  - r'7n". 

Of  course,  the  experiment  is  never  actually  repeated  at  Xq.  The 
sample  information  is  scattered  over  the  region  of  the  x-spaee.  Let  the 
sample  size  used  to  form  the  binary  regression  estimate,  p(x),  be  N.  Had 
Che  experiment  been  repeated  N times  at  x^,  more 
would  be  available  than  is  contained  in 

p(5q)  * U + exp(-x^))  *, 

where  6 is  the  maximum  likelihood  estimate  of 
on  estimation  of  p(Xq)  by  the  maximum  likelihood 
Thus,  the  variability  of  y/N  should  be  less  than 
experiment  were  in  actuality  repeated  N times  at 


Information  on  p(Xq) 


(For  further  discussion 
method  see  (2].) 
that  of  P(Xq)  if  the 

V 


The  posterior  distribution  could  be  approximated  by  substituting 
Np(Xg)  for  y in  the  expression  for  r".  However,  then  the  amount  of  sample 
information  would  be  over-represented  because  the  sample  information  in 
p(Xg)  is  less  than  that  in  y obtained  by  repeating  the  experiment  N times 
at  Xq.  To  adjust  for  this  over-representation  of  sample  information  at 
Xq,  a number,  N*,  can  be  found  such  that  the  sample  information  obtained 

by  replicating  the  experiment  N*  times  at  Xq  will  equal  the  sample  infor- 

A * 

mation  represented  by  p(Xg).  N in  this  technical  report  is  referred  to 
as  the  effective  sample  size. 

* * 

To  compute  such  a number,  N , find  an  N such  that 


Var (y/N  ) - VarlpCxg)]. 


Both  variances  cannot  be  found  exactly,  but  reasonable  estimates  can  be 


found  for  each.  By  expanding 


p(x)  - [1  + exp(-x^|)] 


in  a first  order  Taylor  series  about  and  then  substituting  for  j?, 

A 

an  approximate  estimate  for  Var[p(x)]  can  be  found.  Thus,  since 

A A 

where  £ denotes  the  estimated  covariance  matrix  of  j),  and 
Var  (y/N*)  ^ pO^Hl  - p(x{j)]/N*, 
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this  implies  that 


1 


N - <p (iio> ^ 1 ” p^))1«o^o)  i' 

* 

N represents  the  amount  of  sample  information  that  can  be  utilized  in 
a binary  regression  set  up  by  specifying  a prior  distribution  for  the 
probability  of  injury  only  at  x^- 

* 

Since  the  effective  sample  size  is  N and  since  y is  approximated 
by  N p(xq),  the  posterior  distribution  (which  is  beta  with  parameters 
r"  - y + r'  and  n"  ■ N*  + n’)  can  be  approximated  by  a beta  distribution 

A >N  A 

with  parameters  r"  - N p(xq)  + r'  and  n"  » N + n'.  Thus 

Elp(2So^ly]  “ p(*o^  + r')/(N  + n') 

is  the  Bayes  point  estimate  of  p(x^).  Through  Bayesian  theory,  it  can 
also  be  shown  that  E[p(x^)|y]  is  the  probability  of  a future  injury  at 
Xq  given  the  present  effective  sample  size  and  prior  information. 

A.  INTERVAL  ESTIMATION 


All  Bayesian  inferences  are  made  by  using  the  posterior  distribution. 


To  compute  a Bayesian  100(1  - ci)X  credible  interval*  for  p(Xq),  Cj  and 
must  be  found  such  that  Pr[c^  < p(x^)  £ c 2 1 y ] * 1 - a.  As  previously 
shown,  the  posterior  distribution  can  be  approximated  by  a beta  distri- 

if  * 

button  with  parameters  r"  - N p(x^)  + r'  and  n"  - N + n'.  Thus,  by- 
using  a beta  distribution  with  parameters  r"  and  n",  approximate  values 
for  and  mav  be  calculated.  Computation  of  c^  and  c,  involves 
evaluation  of  the  inverse  incomplete  beta  function.  This  function  is 


1 


tabled  In  lb],  for  example.  The  values  of  and  C2  can  also  be  computed 
from  percentiles  of  the  F-distribut ion,  as  described  in  [1].  Figure  1 
gives  examples  of  Bayesian  credible  intervals  and  their  corresponding 
maximum  likelihood  estimate  confidence  intervals  for  models  A and  B using 
sample  sizes  100  and  1000  for  each  model.  See  [4,  5]  for  a description 
and  discussion  of  these  models.  The  critical  Xq  vector  used  In  these 
examples  was: 

- (-0.0862,  -0.2621,  0.9114,  -0.7660,  -0.1363,  -0.4655). 

B.  HYPOTHESIS  TESTING 

There  is  no  standard  method  of  Bayesian  hypothesis  testing.  This 
technical  report  will  use  a Bayesian  hypothesis  testing  technique  that 
employs  the  ratio  of  posterior  probability  of  the  null  hypothesis  being 
true  to  the  posterior  probability  of  the  alternative  hypothesis  being 
true.  The  hypotheses  will  be  restricted  to  the  form 

V 1 po 

versus 

V p(V  < v 

To  test  the  null  hypothesis,  against  the  alternative  hypothesis,  H^, 

compute  the  posterior  probability  that  p(xq)  < Pq»  and  form  the  ratio 

PrIpUQ)  2 PolyJ/PrlpCx^,)  < pQ|y). 

This  represents  the  posterior  odds  ratio  of  Hq  to  Hj . Suppose  a criterion 
value  of  l)  is  used  for  decision-making.  If  the  odds  ratio  is  less  than 


MODEL 


Figure  1:  Examples  of  Bayesian  Credible  Intervals  and  Corresponding 
Maximum  Likelihood  Estimate  Confidence  Intervals 


V 


is.  / 


ft,  Hq  is  rejected;  if  not,  then  H^  cannot  be  rejected  based  on  the 
value  ft . The  probabilities  used  to  form  the  above  odds  ratio  can  be 
approximated  using  a beta  distribution  with  parameters  r"  ■ N p(Xq)  + r' 
and  n"  - N + n'.  Figure  2 contains  a chart  of  hypothesis  testing  examples 
using  models  A and  B with  sample  sizes  of  100  and  1000  for  each.  The 
critical  Xq  vector  used  in  these  examples  was: 

- (-0.7770,  -0.3436,  0.7804,  0.5683,  0.0975,  -0.5476). 

C.  CRITICAL  COMBINATIONS  OF  THE  VARIABLES 


An  inference  of  much  importance  in  this  study  is  the  assessment  of 
p(Xq),  the  true  probability  of  injury  for  a given  set  of  conditions  x^, 
in  relation  to  some  small  probability,  p^.  Such  inferences  can  be  made 
by  the  prediction  of  critical  envelopes  and  by  testing  hypotheses  concern- 
ing p(Xq)*  To  see  how  to  best  use  Bayesian  hypothesis  tests  for  making 
inferences  as  stated  above,  see  [2]  and  substitute  Bayesian  tests  for  the 
classical  procedures  given  there. 

A critical  envelope  can  be  defined  as  the  set  of  all  combinations 
of  independent  variables  for  which  the  predicted  probability  of  injury 
equals  some  given  value.  However,  in  this  case  variability  in  the  pre- 
dicted probability  causes  variability  in  the  prediction  of  the  critical 
envelope.  To  predict  safer  critical  envelopes,  such  variability  must 
be  accounted  for.  As  shown  in  [2],  a safer  critical  envelope  can  be 
formed  from  the  set  of  x vectors  such  that  the  upper  end  point  of  a 
100(1  - ct)X  right-sided  confidence  interval  for  p(x)  equals  p^.  A 
Bayesian  analogy  of  this  is  formed  by  using  Bayesian  credible  intervals 
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* 


Figure  Some  Hypothesis  Testing  Examples 


to  Incorporate  sample  and  prior  information  into  the  prediction  of  a 
critical  envelope. 


An  exact  Bayesian  critical  envelope  requires  prior  estimates  of  r' 
and  n'  for  each  x^.  These  estimates  are  likely  to  be  difficult  to  acquire. 

However,  it  is  reasonable  to  assume  that  the  prior  distribution  for  p(x) 

Is  the  same  for  all  x's  in  the  critical  envelope.  A practical  choice  for 
a prior  distribution  is  a beta  distribution  with  parameters  r'  and  n'  such 
that  (r'/n')  ■ p^.  The  value  of  n'  can  be  chosen  to  represent  the  strength 
of  the  experimenter's  prior  convictions.  For  example,  if  the  experimenter 
feels  that  his  or  her  prior  information  is  worth  ten  sample  data  observe- 

j <j 

tlons,  then  the  choice  would  be  n'  - 10. 

To  compute  a right-sided  Bayesian  100(1  - a)X  credible  interval  for 
p(x),  a number  c must  be  found  such  that 

Pr[0  < p(x)  _<  c|y]  - 1 - C». 

Here  p(x)  has  a beta  distribution  with  parameters  r"  and  n"  which,  as 
defined  previously.  Incorporate  both  the  prior  and  sample  information.  To 
construct  a Bayesian  critical  envelope,  all  x/s  which  satisfy  c - Pq  must 
be  found.  Methods  of  numerical  analysis  are  required  since  c,  as  a function 
of  x,  has  no  closed  functional  form. 

Although  there  is  no  closed  functional  form  for  the  equation  describing 
the  Bayesian  critical  envelope,  it  is  possible,  using  a digital  computer, 
to  find  an  approximate  closed  form  for  the  Bayesian  critical  envelope.  To 
do  this,  first  store  (in  core  or  on  data  records)  a grid  of  points  roughly 
spanning  the  jc-space.  For  each  point  on  the  grid,  compute  the  corresponding 
upper  end  point  of  the  right-sided  100(1  - a)X  Bayesian  credible  interval. 

Then  use  multiple  regression  to  find  a best  fitting  prediction  equation 

j 

J 
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for  the  credible  interval  end  point.  Let  pu(x)  be  the  upper  end  point 
of  the  right-sided  100(1  - a)X  credible  interval  for  p(x).  Then  the 
approximate  equation 

m 

p ( x ) - b (a)  + T.  b (a)x  (x) 
u 0 i-l  1 1 

results  from  the  regression  analysis,  where  b^(a),  i - 0,  l,...,m  are 
the  regression  coefficients  and  where  *j(x)  is  the  i-^1  term  of  the  best 
fitting  multiple  regression  equation.  Notice  that  because  Pu(x)  depends 
upon  a,  the  b^'s  must  also  depend  upon  a.  Once  the  regression  equation 
has  been  found,  set  py  - p^.  An  approximate  Bayesian  critical  envelope 
is  then  given  by 

m 

ctt(P0)  " fx:P0  “ bo<a>  + s hi(a)*1(x)}. 

This  approximate  critical  envelope  yields  a simple  closed  functional  form 
and  is  thus  useful  for  theoretical  purposes. 

A revealing  way  to  study  a critical  envelope  is  to  make  2-diroensional 
plots  of  one  of  the  x^  variables  versus  one  of  the  other  x^  variables  with 
Pg  and  the  other  x^  variables  fixed  at  given  values.  This  was  done  in  a 
previous  Desmatlcs  technical  report  (2)  for  the  classical  inference  situa- 
tion. The  approximate  or  exact  Bayesian  critical  envelope  may  be  used 
to  create  these  plots.  For  example,  with  p^  and  x^,...,x^  satisfy  the 
critical  envelope  equation.  Then  it  can  be  graphically  seen  how  x^  and  X2 
should  vary  to  keep  the  probability  of  injury  below  p^  for  fixed  x^,...,x^. 
Obviously  to  plot  x^  versus  x^  one  must  assign  values  to  x^,  say,  and  then 
solve  for  x^.  For  the  exact  Bayesian  critical  envelope  a numerical  analysis 
computer  routine  would  be  needed  to  do  this  since  no  closed  functional  form 


exists.  The  approximate  Bayesian  critical  envelope  may  also  require  the 
use  of  a numerical  analysis  routine  depending  upon  the  complexity  of  the 
resulting  multiple  regression  equation. 


V.  SUMMARY  AND  DISCUSSION 


The  incorporation  of  prior  information,  through  the  use  of  Bayesian 
techniques,  may  improve  upon  the  inference  procedures  described  In  a 
previous  technical  report  [2],  It  should  be  recognized  though,  that 
large  differences  between  the  Bayesian  and  maximum  likelihood  procedures 
may  be  due  to  bad  sample  Information  or  gross  mlsspecif ication  of  prior 
information.  Thus,  if  large  differences  occur,  both  information  sources 
should  be  rechecked  thoroughly.  Further  Improvement  in  inference  procedures 
may  possibly  be  achieved  by  utilizing  preinjury  data.  This  is  briefly 
discussed  in  [3]. 

In  constructing  inference  procedures  for  the  assessment  of  injury 
probability,  all  available  information  should  be  used.  The  utilization 
of  more  sample  information  can  be  based  on  observed  preinjury  data.  As 
shown  in  [3],  preinjury  data  should  increase  the  accuracy  of  the  logistic 
regression  coefficient  estimates.  This  aspect  of  the  injury  probability 
prediction  research  is  extremely  important,  since  it  is  imperative  that 
the  most  useful  information  be  gained  without  subjecting  human  subjects 
to  injury.  Research  on  methods  of  employing  preinjury  data  is  now  under 
way. 
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